Basic Statistics

Raw Counts

Name Value
Rows 81,115
Columns 15
Discrete columns 13
Continuous columns 2
All missing columns 0
Missing observations 90,159
Complete Rows 22,007
Total observations 1,216,725
Memory allocation 44.7 Mb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (with frequency)

## 8 columns ignored with more than 50 categories.
## description: 81090 categories
## designation: 29512 categories
## province: 367 categories
## region_1: 1050 categories
## title: 80462 categories
## variety: 708 categories
## vintage: 56 categories
## winery: 12833 categories

QQ Plot

## Warning: Removed 70 rows containing non-finite values (`stat_qq()`).
## Warning: Removed 70 rows containing non-finite values (`stat_qq_line()`).

Correlation Analysis

## 7 features with more than 20 categories ignored!
## description: 22002 categories
## designation: 10138 categories
## region_1: 207 categories
## title: 21941 categories
## variety: 190 categories
## vintage: 21 categories
## winery: 2834 categories
## Warning in cor(x = structure(list(points = c(86, 86, 88, 88, 88, 88, 94, : the standard
## deviation is zero

Principal Component Analysis

## 6 features with more than 50 categories ignored!
## description: 22002 categories
## designation: 10138 categories
## region_1: 207 categories
## title: 21941 categories
## variety: 190 categories
## winery: 2834 categories
## Warning in plot_prcomp(data = structure(list(country = c("US", "US", "US", : The following features are dropped due to zero variance:
##  * country_US